Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Scene graph-aware cross-modal image captioning model
Zhiping ZHU, Yan YANG, Jie WANG
Journal of Computer Applications    2024, 44 (1): 58-64.   DOI: 10.11772/j.issn.1001-9081.2022071109
Abstract278)   HTML7)    PDF (1879KB)(183)       Save

Aiming at the forgetting and underutilization of the text information of image in image captioning methods, a Scene Graph-aware Cross-modal Network (SGC-Net) was proposed. Firstly, the scene graph was utilized as the image’s visual features, and the Graph Convolutional Network (GCN) was utilized for feature fusion, so that the visual and textual features were in the same feature space. Then, the text sequence generated by the model was stored, and the corresponding position information was added as the textual features of the image, so as to solve the problem of text feature loss brought by the single-layer Long Short-Term Memory (LSTM) Network. Finally, to address the issue of over dependence on image information and underuse of text information, the self-attention mechanism was utilized to extract significant image information and text information and fuse then. Experimental results on Flickr30K and MS-COCO (MicroSoft Common Objects in COntext) datasets demonstrate that SGC-Net outperforms Sub-GC on the indicators BLEU1 (BiLingual Evaluation Understudy with 1-gram), BLEU4 (BiLingual Evaluation Understudy with 4-grams), METEOR (Metric for Evaluation of Translation with Explicit ORdering), ROUGE (Recall-Oriented Understudy for Gisting Evaluation) and SPICE (Semantic Propositional Image Caption Evaluation) with the improvements of 1.1,0.9,0.3,0.7,0.4 and 0.3, 0.1, 0.3, 0.5, 0.6, respectively. It can be seen that the method used by SGC-Net can increase the model’s image captioning performance and the fluency of the generated description effectively.

Table and Figures | Reference | Related Articles | Metrics
Node identity authentication scheme for clustered WSNs based on P-ECC and congruence equation
ZHOU Zhiping ZHUANG Xuebo
Journal of Computer Applications    2014, 34 (1): 104-107.   DOI: 10.11772/j.issn.1001-9081.2014.01.0104
Abstract602)      PDF (675KB)(434)       Save
Concerning the problems of large node memory occupation, complex calculation, low information safety degree, in the legal identity authentication when new node joins in sensor networks, a node authentication mechanism of highly safety degree applicable to the limited memory network was proposed. The mechanism used the password to add the node itself, and one-way Hash function was applied to the password and IDentity (ID) for hashing. Password was involved in the generation of the elliptic curve signature algorithm and authentication scheme of congruence equation was adopted between credible nodes. Each certification stage used mutual authentication mode. The proposed algorithm not only can prevent eavesdropping, replay, injection and so on, but also is able to resist guessing attack, mediation attack, anonymous attack and denial of service attack. The comparison with the existing algorithms show that the proposed scheme can reduce the node original memory occupation of three unit level and can reduce key detection rate.
Related Articles | Metrics